18. 


PROOF OF THE HITHERTO UNDEMONSTRATED FUNDA- 
MENTAL THEOREM OF INVARIANTS. 


[Philosophical Magazine, v. (1878), pp. 178—188.] 


I AM about to demonstrate a theorem which has been waiting proof for 
the last quarter of a century and upwards. It is the more necessary that this 
should be done, because the theorem has been supposed to lead to false conclu- 
sions, and its correctness has consequently been impugned*. But, of the two 
Suppositions that might be made to account for the observed discrepancy 
between the supposed consequences of the theorem and ascertained facts—one 
that the theorem is false and the reasoning applied to it correct, the other 
that the theorem is true but that an error was committed in drawing certain 
deductions from it (to which one might add a third, of the theorem and the 
reasoning upon it being both erroneous)—the wrong alternative was chosen. 


* Thus in Professor Faà de Bruno’s valuable Théorie des Formes Binaires, Turin, 1876, at the 
foot of page 150 occurs the following passage:—‘‘Cela suppose essentiellement que les équations de 
condition soient toutes indépendantes entr’elles, ce qui west pas toujours le cas, ainsi qu’il résulte 
des recherches du Prof. Gordan sur les nombres des covyariants des formes quintique et sextique.” 

The reader is cautioned against supposing that the consequence alleged above does result from 
Gordan’s researches, which are indubitably correct. This supposed consequence must have 
arisen from a misapprehension on the part of M. de Bruno of the nature of Professor Cayley’s 
rectification of the error of reasoning contained in his second memoir on Quantics, which had led 
to eae discordant with Gordan’s. Thus error breeds error, unless and until the pernicious 
brood is stamped out for good and all under the iron heel of rigid demonstration. In the early 
part of this year Mr Halsted, a Fellow of Johns Hopkins University, called my attention to 
this passage in M. de Bruno’s book; and all I could say in reply was that ‘‘the extrinsic evidence 
in Support of the independence of the equations which had been impugned rendered it to my 
mind as certain as any fact in nature could be, but that to reduce it to an exact demonstration 
transcended, I thought, the powers of the human understanding.” 

At the moment of completing a memoir, to appear in Borchardt’s Journal, demonstrating my 
ivarter-of-a-century-old theorem for enabling Invariants to procreate their species, as well by an act 
o Self-fertilization as by conjugation of arbitrarily paired forms, the unhoped and unsought-for 
prize fell into my lap, and I accomplished with scarcely an effort a task which I had believed lay 
Outside the range of human power. 
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An error was committed in reasoning out certain supposed consequences of 
the theorem ; but the theorem itself is perfectly true, as I shall show by an 
argument so irrefragable that it must be considered for ever hereafter safe 
from all doubt or cavil. It lies at the basis of the investigations begun 
by Professor Cayley in his Second Memoir on Quantics, which it has fallen 
to my lot, with no small labour and contention of mind, to lead to a happy 
issue, and thereby to advance the standards of the Science of Algebraical 
Forms to the most advanced point that has hitherto been reached. The 
stone that was rejected by the builders has become the chief corner-stone 
of the building. 


I shall for greater clearness begin with the case of a single binary quantic 
(a,b,c, ..., a, yf. Any rational integral function of the elements a, b, c, ... l 
which remains unchanged in value when for them are substituted the elements 
of the new quantic obtained by putting æ + hy instead of æ in the original 
one, I call a Differentiant in æ to the given quantic. 


By a differentiant of a given weight w and order j, I mean one in every 
term of which the combination of the elements is of the jth order and the sum 
of their weights w, the weights of the successive elements (a, b, c, ... l) them- 
selves being reckoned as 0, 1, 2, ... č respectively. 


` The proposition to be proved is, that the number of arbitrary constants in 
the most general expression for such differentiant is the difference between the 
number of ways in which w can be made up with of the integers 0, 1, 2, 3,...1 
(repetitions allowable), less the number of ways in which w— 1 can be made up 
with the same integers. We may denote these two numbers by (w :i,/); 
{(w — 1) : i,j} respectively, and their difference by A(w:7, j). Then, if we call 
the number of arbitrary constants in the differentiant of weight w and order j 
belonging to a binary quantic of the ith order D (w : i, j), the proposition to be 
established is that D (w:7,7)=A (w : i, 7). 

Let us use Q to denote the operator 


agibir 4 ibe, 
and O to denote the operator 
oS 4 AN N EA +15, 

Then it is well known that the necessary and sufficient condition for D being 
a differentiant in æ is that the identity OD =0 be satisfied. 

Let us study the relations of Q and O in respect to D. 

In the first place, let U be any rational integral function of the elements 
of order j and weight w ; then I say that 


0.0.U-0.0.U=(ij — 2w) U. 
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For if we use * to signify the act of pure differential operation, it is obvious that 
2.0.U0=(Q x 0) U+(Q*0) U, 
0.9.U=(Q x 0) U + (0*Q) U; 


Thasatire 3.0.0-0.0, A aie is aE 
= iat 426-165 +3(- aot EF niya 
A A 
ziat +¢-2)b 5 G-A.. BETE MELA 


If now pa? . b1. c” ... l, where p is a number, be any term in U, we have 


+q+r+...+t=j 3 
. a Web d.. nea by hypothesis’; 


therefore o.0.0-0.0.T0, 
that is 
{ad d d d 
ifatt ntent n U 
d d a 
-2( bmt 7. +5) U 


= Xp (tj — 2w) (a? . bY. c”... U) 
= (tj — 2w) U, as was to be proved. 


If now for U we write D a differentiant in æ, we have QD = 0, and there- 
fore 


0, 0. D= 8D, 
where 8 = ij — 2w. 


Again, 
Q.0(0.D)—0.9(0.D)={ij—2(w+1)0.D; 
for O. D is of the weight w+ 1; 
therefore œ. 0°. D=2.08D+(8-—2)2.0.D 
=(28-2)9.0.D 
= §(26 — 2) D. 
Similarly it will be seen that 
08. 0°. D = ò (28 — 2) (36 — 6) D, 
and in general i 
04, 02. D = § (28 — 2) (38 — 6) ... {q8 — (¢ — q), D 
=(1.2.3...q) 18. (8—1) (8—2)... (è—qg +1) D 
the successive numbers 8, 28 — 2, 38 — 6, &c. being the successive sums of the 
arithmetical series 6, ò — 2, ò — 4, ò — 6, &e. 
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To find the most general differentiant in question, we must take every 
combination of the elements whose weight is w and order 7, of which the 
number is obviously (w:7,7), and prefix an indeterminate constant to each 
such combination; then operating upon this form with Q, we shall reduce 
its weight by unity, and shal] obtain as many combinations of this reduced 
weight (the order j remaining unchanged) as there are units in {(w — 1):7,j}. 
Each of these combinations will have for its coefficient a linear function of the 
assumed indeterminate coefficients; and in order to satisfy the identity 
QD =0, each such linear function must be made equal to zero. There are 
therefore (w :7,7) quantities connected by {(w — 1) : 7,7} homogeneous equations. 
Supposing the equations to be independent, the number of the indeterminate 
coefficients left arbitrary is obviously the difference between these quantities, 
namely, A(w:%,7). The difficulty consists in proving this independence—a 
difficulty so great that I think any one attempting to establish the theorem, 
as it were by direct assault, in this fashion, would find that he had another 
Plevna on his hands. But a position that cannot be taken by storm or by sap 
may be turned or starved into surrender; and this is how we shall take our 
Plevna. Be the equations of condition linearly independent or not, it is 
obvious that we must have D(w:7, j) equal to or greater than A (w :i, j). I 
` shall show by aid of a construction drawn from the resources of the Imagina- 
tive Reason, and founded on the reciprocal properties that have just been 
exhibited by the famous O and Q, that this latter supposition, of the first 
member of the equation being greater than the second, is inadmissible and 
must be rejected. Observe that, (0:7, j), the number of ways of making up 0 
with j combinations of 0, 1, 2,... i, is 1; also that D(0:i, j), the number 
of arbitrary constants in the most general differentiant in æ to the quantic 
(a, b, c, ... ýx, yý of order j and weight 0, is also 1; for such differentiant is 
obviously Xa”. 


Thus we have for all values of w, 


D(w:i, j)= or >(w:i,j)— ((w-1) :4, j}, 


and also 
D(0:%, 7) = (0:4, 9); 
therefore 
D (wt, j) + D {((w—1):%, j} + D {(w-2): ij} +... + D(O:4, 9) 


= or >(w:t, 7). 
If in the above condition, for any assumed value of w, > is the sign to be 
employed, then the equation D (w :i, j7)=A(w:i,j) cannot be satisfied for 
all values of w. If, on the other hand, > is not the sign to be employed, then 
this equation, for every value of w, commencing with the assumed one down to 
0, must be satisfied. The greatest value of w for given values of i, j, it is 
i ij — 


2 for ij even, and 9 Fr ij odd. Let us give to w this 


well known, is 
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maximum value in the above “greater or equal” relation; for brevity, denote 
the differentiants whose types are [w, 2,7], [(w— 1), 2,7]... by [w], [w—1], 
[w—2], &c. respectively, « and j being regarded as constants. It will be 
convenient to substitute for the number of arbitrary constants in any of these 
differentiants the same number of linearly independent specific values of 
them; so that we shall have D(w:7,7) of linearly independent [w]’s, 
D\(w—1):%,7} of linearly independent [w—1]’s, and so on. Now, instead 
of D {(w—q):%, j} differentiants [w— q], let us substitute the same number 
of the derived forms 0%(w—q}. I shall prove that the quantities (all of the 
same weight w) thus obtained are linearly independent of one another. 


For suppose that those belonging to any one set O%.[w—q] are not 
independent, but are connected by a linear equation. Then, operating upon 
this equation with Q4., we shall obtain a linear equation between the quanti- 
ties [w-— q], for each quantity 0%. 01. [w — q] is a numerical multiple of 
[w—q]; which is contrary to the hypothesis. Again, let there be a linear 
equation between the quantities contained in any number of sets of the form 
01. [w —q] for which m is the greatest value of q. Then, operating upon this 
with Q”, it is clear that all the quantities in the sets for which q <m will 
introduce quantities of the form Q"-7[w—gq] where m—gq>0, and which 
consequently vanish. There will be left, therefore, only quantities of the 
form [w — q], between which a linear equation would exist, contrary to hypo- 
thesis, as in the preceding case. Therefore all the quantities in all the sets 
are linearly independent. But these are all of the weight w, that is, 


E or "> | 


and are therefore linear functions of the number of ways in which the in- 
tegers 0, 1, 2, 3, ... ú can be combined i and j together so as to give the 
Weight w. Therefore being linearly independent, as just proved, their number 
cannot exceed this last-named number, that is, cannot exceed (w: t, j). That 
18 to say, 

D (w:t,j)+D{(w-1):%,j} +...+ D(0:4%,9) 
cannot exceed (w : i, 7). Therefore every one of the equations 

D(w:%, 7) = A(w:4, 9) 


must be satisfied from the maximum value of w down to the value 0, which 
Proves the great hitherto wndemonstrated fundamental theorem for a single 
quantie. 


For any number of quantics the demonstration is precisely similar at 
all points: there will be as many systems of 7, 7 as there are quantics. 
(wii gry, j :&c.) will denote the number of ways of making up w with j 
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of the integers 0, 1, 2,... i, with 7’ of the integers 0,1, 2,...2’, and so on. 
The theorem to be demonstrated will be 


D(w:i, jit’: A a ata 


Q will become (ad a + 2b Spn -) 3 


a k 4a z(o% +(@-Des +. p 


It will still be true that Q9. 07. D—where D is a differentiant in æ (that is, 
a function of the elements in all the given quantics which withstand change 
when these are transformed by writing æ + hy for «)—is a numerical multiple 
of D; and D will be subject to the identity OD=0. We shall still have 


DAW: 528,72...) or > A(wi9, 939,72...) 
and DO 2%, 220), 7 ean MOE, PR a sak 
and shall be anie in precisely the same way as before to demonstrate the 
impossibility of È "p (w—k:i j: i,j: ...) being greater than (w:t,9:0',7': ...), 


and so shall be Kip to infer by the same logical scheme 

A (wt, 920, 93 0.) = D( wit, 720, 7s oe) 
This is my extension of Professor Cayley’s theorem, which leads direct to the 
Generating Fractions given in my recent papers in the Comptes Rendus. 


In a series of articles which I hope to publish in the American Journal 
of Pure and Applied Mathematics, I propose to give a systematic develop- 
ment of the Calculus of Invariants, taking a differentiant as the primordial 
germ or unit. I have spoken of a differentiant in æ, and of course might 
have done so equally of a differentiant in y. If we call the former Dz, 
it is capable of being shown, from the very natures of the forms O and 
Q, that if the quantity 7j—2w, which may be called the degree of Dz, 
be called 8, then O° D, becomes a differentiant in y. These may be termed 
simple differentiants; but the principle of continuity forbids that we 
should omit to comprise in the same scheme the intermediate forms 0? Dz 
or 02 Dy, through which simple differentiants in æ and y pass into each other. 
These may be termed mixed differentiants; O? D, may be termed a differ- 
entiant p removed (as we speak of cousins once, twice, &c. removed) from 2, 
which will be the same thing as 02D, (a differentiant q removed from y) 
if p +q is equal to the degree, namely, ij — 2w. Now all these differentiants, 
whether simple or mixed, possess a wonderful property, which may be deduced 
by means of Salmon’s Theorem, given in the Philosophical Magazine for 
August 1877. They are all, in an enlarged sense of the term, Invariants—iD 
this sense to wit, that if the elements are made to undergo a substitution 
consequent upon or, as we may say, induced by a general linear substitution 
impressed on the variables, which for greater simplicity of enunciation may be 
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supposed to have unity for the determinant of its matrix, then every differ- 
entiant, whether single or double (the latter being equivalent to an invariant), 
and whether simple or mixed, will remain a Constant Function of the Co- 
efficients of the impressed substitution. To wit, if the differentiant be p 
removes from g and q removes from y (so that its degree is p + q), and if the 
impressed substitution be læ + ry for æ, and mx + py for y, where lu — ìm = 1, 
then will the differentiant be a constant bipartite quantic in the two sets 
of coefficients l, m and X, p, of the degree qg in the former and p in the latter— 
a theorem which amounts almost to a revolution in the whole sphere of 
thought about Invariants. 


I have borrowed the term “Imaginative Reason” from a recent paper 
of Mr Pater on Giorgione, in which, as in many of those of Mr Symonds 
(I will instance one on Milton in particular), I find a continued echo of 
my own ideas, and in the latter many of the very formule contained in 
my Laws of Verse, where versification in sport has been made esthetic 
in earnest. Surely the claim of Mathematics (its “Andersstreben”) to 
take a place among the liberal arts must be now admitted as fully made 
good. Whether we look to the advances made in modern geometry, in 
modern integral calculus, or in modern algebra, in each of these a free 
handling of the material employed is now possible, and an almost unlimited 
scope left to the regulated play of the fancy. It seems to me that the whole 
of xsthetic (so far as at present revealed) may be regarded asa scheme having 
four centres, which may be treated as the four apices of a tetrahedron, namely 
Epic, Music, Plastic, and Mathematic. There will be found to be a common 
plane to every three of these, outside of which lies the fourth; and through 
every two may be drawn a common axis opposite to the axis passing through 
the remaining two. So far is certain and demonstrable. i think it also 
possible that there is a centre of gravity to each set of three, and that the 
lines joining each such centre with the outside apex will intersect in a 
common point the centre of gravity of the whole body of esthetic; but what 
that centre is or must be I have not had time to think out. 


Postscript.—In the first fervour of a new conception, I fear that in the 
manuscript which is now on its way to England I may have expressed myself 
With some want of clearness or precision on the subject of pure and mixed 
differentiants. I will therefore add a few more explanatory and vaticinatory 
Words on this subject, through the medium of which I catch a glimpse of the 
Possibility of obtaining a simple proof of Gordan’s theorem, just as through 
the medium of pure differentiants taken per se I caught a glimpse (almost 
immediately afterwards to be converted into a certainty) of the proof of 
Cayley’s theorem given in this memoir. I conceive that what the ensemble 
of pure differentiants have done for the one, the larger ensemble of all sorts of 
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differentiants, pure and mixed, taken together, will enable me or some one 
else to accomplish for the other. 


Any function of the coefficients of a quantic which is nullified by the 
operation upon it of Q, which we may call the revector symbol, or in other 
words, whose first revect is zero, is a pure differentiant in æ. So, of course, if 
nullified by the operation upon it of O, which may be called the provector 
symbol, it is a pure differential in y. We may call ij—2w, where č is the 
degree of the quantic, j the order of a pure differentiant, and w its weight in 
æ, the grade of the differentiant, and denote this grade by 6. 


The Sth provect of a pure differentiant in æ is of course a pure differentiant 
in y, which is & removes from s, as the pure differentiant in æ is § removes 
from y. If q be less than 6, the gth provect of a pure differentiant in æ 
is a mixed differentiant g removes from æ, or, if we like to say so, (6—4q) 
removes from y. The grade of a mixed differentiant may be defined to 
be the same as that of the pure differentiant of which it is a provect or 
revect. 


Then, in the first place, we have this proposition :—If any linear substitu- 
tion whatever be impressed on the variables of a quantic, the transformed value 
of any of its differentiants will separate into two factors, of which one will be 
the determinant of substitution raised to the power w, where w is the weight 
corresponding to the order and grade of the differentiant and the degree 
of the quantic. The remaining factor will be a function of the coefficients of 
substitution, and may be called the outstanding factor. Of this I shall 
proceed to speak, 

Let w be replaced by ha + ly, 
y y T kæ + my. 


Then the outstanding factor for the transformed D (a pure differentiant in # 
of the grade ô) may be proved by repeated applications of Salmon’s theorem 


to be equal to 
a 2 2 
(1 yE O.+ (=) as +.) mD, 
m m/ 1.2 
where of course the series of terms in the development will, after the (6 + 1)th 


term, vanish spontaneously. In other words, the outstanding factor of the 
kO. 


transformed D is m’e ™ . D, where it will be noticed that only the coefficients 
of substitution due to the change in y make their appearance. 


If now we take any mixed differentiant, say the gth provect of D, that is, 
02. D, its outstanding factor, I find, will be the qth emanent of the out- 
standing factor for D, that is, will be 


(a A +1 ay (m er) D. 
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And here for the present I end. The subject is, as it was, a vast one; 
and this conception of mixed differentiants opens out still vaster horizons. 
Every thing grown on American soil, or born under the influence of its skies, 
as its lakes, its rivers, its trees, and its political system, seems to have a 
tendency to rise to colossal proportions. 


I will merely add one remark which has occurred to me relating to < 
Sturm’s theorem and the process of Algebraical common measure in general. 
If f(a, y) be a rational integral function of xv, y, and f” (a, y) its derivative in 
respect to æ, and we perform the process of common measure between them 
regarded as functions of æ, we know that the irreducible part of the successive 
remainders taken in ascending order, say U,, U;, U,,..., will have for their 
leading coefficients (say D,, Dı, D,...) the discriminants of f and of its 
successive derivatives in respect to æ respectively. 

Here D, is an invariant of the given form; 

D, (a differentiant in æy will be the leading coefficient of the co- 


variant 
2 


D,a? + O. Dyvy+ ie Dias 


D, (another differentiant in x) will be the leading coefficient of the 
covariant 


2 
D,o + 0. Day + se D, a4? + atts D æy’ + Day, 


OF, 

FO 1.2.3.4 
and so on until we come back to the first Sturmian remainder of (a, y}, the 
irreducible part of which (or we may call it the Sturmian Auxiliary Proper) 
is the Hessian differentiated down from being of the degree 2i — 4 to the 
degree i — 2, that is, to half of what it was at first; and so in like manner 
every Sturmian Auxiliary Proper is, so to say,a Covariant differentiated down 
to half its original dimensions. 

The above invariant and the following covariants may be called V,, V,, 
V.,... respectively. The interesting point in question is that (to numerical 
factors près) 

Rint Ys ae (4) ¥ U, =(£)'v 
? da 1> 2 dæ 23 3 da 3) 
and so on. 

So more generally for any two functions f(a, y), $ (a, y), the irreducible 
part of the remainders obtained by common-measuring them with respect to 
æ will all be derivatives in regard to æ of covariants of the two given quantics, 
If we take for our quantics 


(a, b, Cy +s h, k, lýg, yý : (a, b’, c, eve K, ki, Va, y)', 
the covariants in question will all be educts of (that is, functions having for 
their leading coefficients) the successive resultants of the forms 


ia. hb Dye e, Bovy], 
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of the forms KasA yal, ak es 
of the forms Ce ae (rae gp 


and so on, the discriminants of which may be called partial resultants of the 
given forms; in a word, the simplified residues arising in the process of 
common-measuring in respect to one of their variables two given binary 
quantics are differential derivatives, in respect to that variable, of the educts 
of their partial resultants (of course with the understanding that the last 
simplified residue is the complete resultant itself). 


This seems to point to the existence of some generalized statement of 
Sturm’s theorem in which the same Educts as above referred to shall appear, 
but where, instead of their derivatives in respect to one of the variables being 
made use of, perfectly general Emanants of them shall be employed as the 
Criterion functions. For I need hardly add that all Educts (although not 
written so as to show it in what precedes) are in fact symmetrical in respect 
to the two sides of the quantic to which they belong. 


On various à priori grounds I suspect the generalized theorem to be as 
follows. If X,, is the covariant (of degree 24) whose wth derivative in respect 
to æ is a Sturmian Auxiliary Proper to F' (æ, y), we may substitute throughout 
for all the values of p, instead of each such derivative, the more general one 


(7 £ = 5 E xu, Where f and gare any assumed positive constants, of course 


with the understanding that the second criterion also is to be ( f e 9%) f 


in lieu of = And the method of Sturm will still be applicable for finding 


the positions of the real roots of * in f(e, y)=0 when we use these more 


general derivatives as the criteria instead of Sturm’s own. When g=0 the 
theorem is that of Sturm; when f= 0.it is an immediate deduction from this 


theorem applied to finding the positions of the root values of Z, when it is 


borne in mind that the motions of ~ and of A , as regards ascent and descent 


. . . . . . . \ 
(excluding the moment for which either of these ratios is indefinitely near to 
zero) are inverse to each other. It is this that accounts for the negative sign 
which precedes g. 


It is difficult to conceive by what theorem other than the assumed one the 
chasm between those extreme cases can be bridged over; and all analogy and 
all belief in continuity veto the supposition that no such bridge exists. 
“Divide et impera” is as true in algebra as in statecraft; but no less true and 
even more fertile is the maxim “auge et impera.” The more to do or to 
prove, the easier the doing or the proof. 
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